GOjen: tdGo Temporal Difference Learning of Go Playing Artificial Neural Networks
نویسنده
چکیده
The original project description has been: An existing Java application handling and visualizing Go games between human and computer players (including trained and evolved ANNs) should be improved and extended with Go playing ANNs trained by temporal difference learning. This extension should serve as a basis for comparisons of td learning with conventional ANN training and evolutionary methods. The ancient eastern board game Go has been proven to be of high complexity. Therefore, conventional computer programs are still far away from world class player level. These properties make the game interesting for applying and testing new approaches, namely temporal difference learning and artificial neural networks (ANNs).
منابع مشابه
Learning to Evaluate Go Positions via Temporal Difference Methods
The game of Go has a high branching factor that defeats the tree search approach used in computer chess, and long-range spatiotemporal interactions that make position evaluation extremely difficult. Development of conventional Go programs is hampered by their knowledge-intensive nature. We demonstrate a viable alternative by training neural networks to evaluate Go positions via temporal differe...
متن کاملTemporal Difference Learning of Position Evaluation in the Game of Go
The game of Go has a high branching factor that defeats the tree search approach used in computer chess, and long-range spatiotemporal interactions that make position evaluation extremely difficult. Development of conventional Go programs is hampered by their knowledge-intensive nature. We demonstrate a viable alternative by training networks to evaluate Go positions via temporal difference (TD...
متن کاملLearning to Play Draughts using Temporal Difference Learning with Neural Networks and Databases
This paper describes several aspects of using temporal difference learning (TD) and neural networks to learn game evaluation functions, and the benefits of using databases. Experiments in tic-tac-toe and international draughts have been done to measure the effectiveness of using databases. The experiment of Tic-TacToe showed that training from database games resulted in better play than learnin...
متن کاملEvaluation in Go by a Neural Network using Soft Segmentation
In this article a neural network architecture is presented that is able to build a soft segmentation of a two-dimensional input. This network architecture is applied to position evaluation in the game of Go. It is trained using self-play and temporal difference learning combined with a rich two-dimensional reinforcement signal. Two experiments are performed, one using the raw board position as ...
متن کاملThe Integration of A Priori Knowledge into a Go Playing Neural Network
The best current computer Go programs are hand crafted expert systems. They are using conventional AI technics such as pattern matching, rule based systems and goal oriented selective search. Due to the increasing complexity of managing this kind of knowledge representation by hand, the playing strength of these programs is still far from human master level. This article describes methods for i...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2004